Accurate Community Detection in the Stochastic Block Model via Spectral Algorithms

نویسندگان

  • SeYoung Yun
  • Alexandre Proutière
چکیده

We consider the problem of community detection in the Stochastic Block Model with a finite number K of communities of sizes linearly growing with the network size n. This model consists in a random graph such that each pair of vertices is connected independently with probability p within communities and q across communities. One observes a realization of this random graph, and the objective is to reconstruct the communities from this observation. We show that under spectral algorithms, the number of misclassified vertices does not exceed s with high probability as n grows large, whenever pn = ω(1), s = o(n) and lim inf n→∞ n(α1p+ α2q − (α1 + α2)p 1 α1+α2 q 2 α1+α2 ) log(n s ) > 1, (1) whereα1 and α2 denote the (fixed) proportions of vertices in the two smallest communities. In view of recent work by Abbe et al. (2014) and Mossel et al. (2014), this establishes that the proposed spectral algorithms are able to exactly recover communities whenever this is at all possible in the case of networks with two communities with equal sizes. We conjecture that condition (1) is actually necessary to obtain less than s misclassified vertices asymptotically, which would establish the optimality of spectral method in more general scenarios.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance of a community detection algorithm based on semidefinite programming

The problem of detecting communities in a graph is maybe one the most studied inference problems, given its simplicity and widespread diffusion among several disciplines. A very common benchmark for this problem is the stochastic block model or planted partition problem, where a phase transition takes place in the detection of the planted partition by changing the signal-to-noise ratio. Optimal...

متن کامل

A Tensor Spectral Approach to Learning Mixed Membership Community Models

Detecting hidden communities from observed interactions is a classical problem. Theoretical analysis of community detection has so far been mostly limited to models with non-overlapping communities such as the stochastic block model. In this paper, we provide guaranteed community detection for a family of probabilistic network models with overlapping communities, termed as the mixed membership ...

متن کامل

Analysis of spectral clustering algorithms for community detection: the general bipartite setting

We consider the analysis of spectral clustering algorithms for community detection under a stochastic block model (SBM). A general spectral clustering algorithm consists of three steps: (1) regularization of an appropriate adjacency or Laplacian matrix (2) a form of spectral truncation and (3) a k-means type algorithm in the reduced spectral domain. By varying each step, one can obtain differen...

متن کامل

A tensor approach to learning mixed membership community models

Community detection is the task of detecting hidden communities from observed interactions. Guaranteed community detection has so far been mostly limited to models with nonoverlapping communities such as the stochastic block model. In this paper, we remove this restriction, and provide guaranteed community detection for a family of probabilistic network models with overlapping communities, term...

متن کامل

Community detection and stochastic block models: recent developments

The stochastic block model (SBM) is a random graph model with planted clusters. It is widely employed as a canonical model to study clustering and community detection, and provides generally a fertile ground to study the statistical and computational tradeoffs that arise in network and data sciences. This note surveys the recent developments that establish the fundamental limits for community d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1412.7335  شماره 

صفحات  -

تاریخ انتشار 2014